Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

The processing of digitized works

Identifieur interne : 001145 ( Main/Exploration ); précédent : 001144; suivant : 001146

The processing of digitized works

Auteurs : José Borbinha [Portugal] ; Joao Gil [Portugal] ; Gilberto Pedrosa [Portugal] ; Joao Penas [Portugal]

Source :

RBID : Pascal:08-0091670

Descripteurs français

English descriptors

Abstract

This paper describes the processing of digitised works at the National Library of Portugal, as done in the scope of the National Digital Library initiate (BND). This comprises the normalization of the names of the images, the creation of technical metadata, image processing, OCR, indexing, and the creation of derived copies for preservation and copies for access in PNG, JPG, GIF, and PDF. The structural descriptions of all the objects an done in METS.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">The processing of digitized works</title>
<author>
<name sortKey="Borbinha, Jose" sort="Borbinha, Jose" uniqKey="Borbinha J" first="José" last="Borbinha">José Borbinha</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Portugal</country>
<wicri:noRegion>1000-029 Lisboa</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Gil, Joao" sort="Gil, Joao" uniqKey="Gil J" first="Joao" last="Gil">Joao Gil</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Portugal</country>
<wicri:noRegion>1000-029 Lisboa</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Pedrosa, Gilberto" sort="Pedrosa, Gilberto" uniqKey="Pedrosa G" first="Gilberto" last="Pedrosa">Gilberto Pedrosa</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Portugal</country>
<wicri:noRegion>1000-029 Lisboa</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Penas, Joao" sort="Penas, Joao" uniqKey="Penas J" first="Joao" last="Penas">Joao Penas</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Portugal</country>
<wicri:noRegion>1000-029 Lisboa</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">08-0091670</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 08-0091670 INIST</idno>
<idno type="RBID">Pascal:08-0091670</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000293</idno>
<idno type="stanalyst">FRANCIS 08-0091670 INIST</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000308</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000491</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000302</idno>
<idno type="wicri:Area/Main/Merge">001163</idno>
<idno type="wicri:Area/Main/Curation">001145</idno>
<idno type="wicri:Area/Main/Exploration">001145</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">The processing of digitized works</title>
<author>
<name sortKey="Borbinha, Jose" sort="Borbinha, Jose" uniqKey="Borbinha J" first="José" last="Borbinha">José Borbinha</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Portugal</country>
<wicri:noRegion>1000-029 Lisboa</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Gil, Joao" sort="Gil, Joao" uniqKey="Gil J" first="Joao" last="Gil">Joao Gil</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Portugal</country>
<wicri:noRegion>1000-029 Lisboa</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Pedrosa, Gilberto" sort="Pedrosa, Gilberto" uniqKey="Pedrosa G" first="Gilberto" last="Pedrosa">Gilberto Pedrosa</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Portugal</country>
<wicri:noRegion>1000-029 Lisboa</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Penas, Joao" sort="Penas, Joao" uniqKey="Penas J" first="Joao" last="Penas">Joao Penas</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Portugal</country>
<wicri:noRegion>1000-029 Lisboa</wicri:noRegion>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Description</term>
<term>Digitizing</term>
<term>Electronic library</term>
<term>Information access</term>
<term>National library</term>
<term>Optical character recognition</term>
<term>Portugal</term>
<term>Preservation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Portugal</term>
<term>Numérisation</term>
<term>Bibliothèque nationale</term>
<term>Accès information</term>
<term>Bibliothèque électronique</term>
<term>Reconnaissance optique caractère</term>
<term>Préservation</term>
<term>Description</term>
</keywords>
<keywords scheme="Wicri" type="geographic" xml:lang="fr">
<term>Portugal</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Numérisation</term>
<term>Bibliothèque nationale</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper describes the processing of digitised works at the National Library of Portugal, as done in the scope of the National Digital Library initiate (BND). This comprises the normalization of the names of the images, the creation of technical metadata, image processing, OCR, indexing, and the creation of derived copies for preservation and copies for access in PNG, JPG, GIF, and PDF. The structural descriptions of all the objects an done in METS.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Portugal</li>
</country>
</list>
<tree>
<country name="Portugal">
<noRegion>
<name sortKey="Borbinha, Jose" sort="Borbinha, Jose" uniqKey="Borbinha J" first="José" last="Borbinha">José Borbinha</name>
</noRegion>
<name sortKey="Gil, Joao" sort="Gil, Joao" uniqKey="Gil J" first="Joao" last="Gil">Joao Gil</name>
<name sortKey="Pedrosa, Gilberto" sort="Pedrosa, Gilberto" uniqKey="Pedrosa G" first="Gilberto" last="Pedrosa">Gilberto Pedrosa</name>
<name sortKey="Penas, Joao" sort="Penas, Joao" uniqKey="Penas J" first="Joao" last="Penas">Joao Penas</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001145 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001145 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:08-0091670
   |texte=   The processing of digitized works
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024